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Memory s true ture(array Jink list^tc) 
to store web cards data 



Hash table index entry for name search 
(64K Unicode Chars) 
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7 



a list of pointer, each points to 
a card whose name is '^i^tf 1 

Hash table index entry for Finyin and 
Tongyin search 





ri 




wang 




zhang 









a list of pointer, each points to a card whose 
name has the prounoucation 'li wei feng 



FIG 3A 



Memory stmcture(array4ink lisiyetc) 

to store web cards data 

(the same structure as Fig3A) 



Hash table index entry for fuzzy search 
(64K Unicode Chars) 
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a list of pointer, each points to a card 
whose name or additional infomation 
include the word ' @ ^ Ef^* 



User input query string A 



name search 



7^ 
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phonetic or homophony search 




Treat A as a Chinese chars string, 
then get it's phonetic spelling. 



homophony search 



Treat A directly as a phonetic 
spelling string. 



phonetic search 



/ Phonetic spelling A' 



Parse the Hash table index tree(the upper 
one in Fig3A) to locate a node Na. The 
path from the root of tree to Na exactly 
match string A. 



2 



Parse the Hash table index tree(the lower 
one in Fi^3A) to locate a node Nb. The 
path from the root of tree to Nb will have 
a phonetic spelling matching string A'. 



Get the pointer list of node Na as the 
result Ra, Ra then will contain all the 
cards whose name exactly match A. 
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Result Ra 



7^ 
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Get the pointer list of node Nb as the 
result Rb, Rb then will contain all the 
cards whose name's phonetic spelling 
exactly match A. 



L 
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Result Rb 



Combine set Ra and Rb to get the final 
result R. 
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Final result R 
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Self-learning Chinese 
word data 
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opm t\ to one 01 more ^rurtese woras: 
W={W 1 J( „.,W B > 



± 



For each woid W x in W, parse the Hash table 
index tree in Fig3B to locate a node N x . The 
path from the root of tree to N exactly match 
wndW.. 



Get the pointer list of node as the result 
lii R r 1 x then will contain all the cards 
whose name or additional information include 
the word W_. 



J 



Get the union of R,* H : 
R=EjU RjO....R H 



± 
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While doing the unoin, calculate the weight of each result in R according to 
specified rules,such as: 

(1) weight of count: the number of words within W that the result contains. 

(2) weight of length: the total length of words within W that the result contains. 

(3) weight of word attribute: assign each word attribute (noun, verb, adj.,etc) a 
individual weight, then we can get a comprehensive weight. 

Finally, calculate the comprehensive weight of each result based on the above 
rules. 



Sort the result list R according to weight of J ^ < R~7 
result, so as the most approximate result s / ^ 



appear at head of the list, and limit the 
number of result in the list. 

/ Final result list R 
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FIG 4B 



